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We analyze a simple model for growing tree networks and find that although it never percolates, 
there is an anomalously large cluster at finite size. We study the growth of both the maximal cluster 
and the cluster containing the original vertex and find that they obey power laws. This property is 
also observed through simulations in a non-linear model with loops and a true percolating phase. 



I. INTRODUCTION 

Recently there has been interest in the properties of 
random networks that are constructed by a growing pro- 
cess. These networks appear to model certain observed 
systems rather better than the random graphs of Erdos 
and Renyi [Q. Already, two reviews are available (|[[|. 

At first, interest concentrated on the degree distribu- 
tion. It has been noticed that networks such as the world 
wide web, the Internet backbone and scientific collabo- 
ration graphs have (at least in some range) a power law 
degree distribution. This is in contrast to the Poisson 
distribution found in random graphs. Barabasi and Al- 
bert |||5| noticed that a power law distribution could be 
obtained in a grown network with preferential attach- 
ment. 

The grown nature of the network creates correlations 
that affect more than the degree distribution. More re- 
cently, these other aspects of grown networks have been 
studied: in particular, the phenomenon of percolation, 
that was of great interest in the study of random graphs. 
Some grown models are devised to be fully connected and 
percolation can not be studied, but in a recent paper, 
Calloway et al. Q studied percolation in a very simple 
network growth model. Their model introduces a new 
vertex at every time step, and also, with probability d, 
makes a link between two existing vertices, chosen at 
random. Percolation in this model displays some inter- 
esting features that distinguish it from percolation in a 
corresponding random graph with the same degree distri- 
bution. For example, the location and order of the phase 
transition are modified by the correlations present in the 
grown model. A similar treatment of other models has 
been performed by Dorogovtsevei al. [Q, who note that 
below the percolation transition, the cluster size distri- 
bution has a power law dependence in contrast to the 
exponential dependence typical in non-grown models. 

In this paper we pursue the investigation of percolation 
in grown networks concentrating on models in which the 
new vertex introduced at each time step, is itself the end- 
point of the link possibly created in that time step. Mod- 
els of this type were in fact the original kind proposed by 
Barabasi and Albert jl| . Only later did Dorogovtsev and 
Mendes || introduce the other type of model in which 



vertex and link creation are decoupled, and which often 
happens to be more convenient for calculation. In gen- 
eral there are families of such models in which more than 
one link is added per time step. When two or more links 
are added there seems to be little to distinguish the sta- 
tistical features of the two families of models and they 
can be used interchangeably. However, for the partic- 
ular case of single link addition, the models do have a 
different character because in the case where the vertex 
is attached, only tree networks can be created. For this 
reason we call this model, that forms the basis of study 
in this paper, the "tree growth model" . 

Although most of the physical networks motivating the 
surge of interest in this subject are not treelike, other ex- 
amples, such as food webs, would appear not to contain, 
or at least to have a low probability of containing loops. 
However, a significant reason for studying a tree growth 
network comes from past experience: tree graphs have 
provided a fruitful field for investigating percolation in 
non-grown networks. They have been studied both in the 
Physics literature and through the mathematical field of 
branching processes. Tree models provide an infinite di- 
mensional or mean field model that is often tractable in a 
way that finite dimensional models are not. We shall find 
that the tree growth model that forms the basis of this 
paper is indeed a simple tractable model that illuminates 
more complicated scenarios. 

The most interesting feature that we shall use this 
model to expose is the power law growth of cluster size 
as the network size increases. In numerical work (and for 
the size of many practical networks) this feature, and 
the presence of what appear to be anomalously large 
clusters, mask the lack of strict percolation in the tree 
growth model. This is because in random graphs, clus- 
ter growth below the percolation threshold is only loga- 
rithmic. However, as was pointed out by Dorogovtsevei 
al. 0, the power law growth based on the underlying 
power law size distribution, makes the whole phase have 
scaling characteristics typical of critical behavior. 

We study the properties of clusters in depth and be- 
sides investigating the distribution of sizes of clusters cho- 
sen at random, we also study the size of the cluster con- 
taining the initial point. This illuminates the intuition 
that there is a highly connected "old core" that forms 
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the nucleus of the large clusters and turns out to give a 
useful analytic handle that is not so obvious in non-linear 
models with loops. To ensure that the phenomena we are 
studying are not an artifact of the tree model we intro- 
duce a non-linear extension and perform some numerical 
simulations. 

It is useful to contrast the properties of this grown tree 
model with a non-grown or static analog. In the present 
case, we argue that the appropriate analog is a branch- 
ing process rather than a random graph. Calloway et 
al. in their paper on percolation |(| , ascribe the cause of 
the differences between percolation on grown networks 
and random graphs to correlations between the degrees 
of vertices at each end of connecting links. We demon- 
strate that the tree growth model does not have any such 
correlations. 

The paper is organized as follows. After defining the 
tree growth model, the branching process we use as a 
static analog is introduced. The percolative and other 
properties of these models are then compared. The main 
results on the tree growth model are contained in the 
sections describing the growth of the maximal cluster and 
the cluster containing the origin. A calculation of the 
vertex degree correlations in this model is the subject of 
section [V. The final part of the paper concerns a non- 



attaching a new link to an existing vertex of degree k is 
dk/t, leading to the following evolution equations: 



linear generalization of the tree growth model, which is 
introduced and numerically simulated in order to confirm 
that the cluster growth properties observed in the tree 
model are preserved in more complicated models with 
loops. 



II. TREE GROWTH MODEL 

In each time step a new vertex is introduced. With 
probability 5, the new vertex is connected to another 
vertex, chosen at random from amongst the existing ver- 
tices. The vertex remains disconnected with probability 
1 — 5. In numerical simulations we always start with a 
single vertex at time t = 1, but we do not expect this 
initial condition to affect results at large times. 

This model only generates clusters of tree graphs. 
There is only a single tree cluster for the case 5 = 1. 
These clusters are fragile in the sense that single dele- 
tions will always destroy connectivity B. 



doit+l) = do(t) - 5^- + (1- 6) 
d 1 (t + l) = d 1 (t)-5^ + 6^ + 5 



(1) 
(2) 



4(Hl) = 4W-^ + ^, fc>2 (3) 

Note that the total number of vertices can be written 
as So° dk (*) = t ano - that the total expected number of 
links is given by | Ylo' kdk(t) = St. Since both quantities 
grow linearly in time we search for solutions of the form, 
dk(t) = Pkt, and find: 



Po = 



Pk 



1 + 5 VI 



k > 1 



(4) 
(5) 



This distribution decays exponentially in contrast to ran- 
dom graph models which have a Poisson degree distribu- 
tion, and the scale free models with power law distribu- 
tion B. 



B. Static Analog - Branching Process 

Before proceeding to investigate clustering issues we 
pause to introduce a non-grown or static analog of this 
model. The static model should have the same vertex 
degree distribution as the grown model, but should be 
constructed to avoid any correlation between the degree 
of linked vertices that might arise from the growing pro- 
cess. Furthermore, the analog should preserve the tree- 
like character of the model, so it cannot be one of the 
classic random graphs of Erdos and Renyi 0. An ap- 
propriate model is based on an ensemble of Galton and 
Watson branching processes fll| , |l2|| . 

A branching process may be regarded as a growth pro- 
cess in its own right, but each vertex is treated iden- 
tically, thus avoiding any potential correlation between 
vertex degrees. In order to reproduce the vertex degree 
distribution, we choose the probability of k offspring to 
be proportional to Pk+i in equations (HE), so: 



A. Degree Distribution 

We commence by investigating the distribution of the 
vertex degrees, that is, the number of links attached to 
a given vertex. Following the notation and methods of 
Callaway et al., we denote the expected number of ver- 
tices of degree k at time t by dk (t) . Since the total num- 
ber of vertices at time t is precisely t, the probability of 
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Pk = 



1 



5 



1 



k > 



(0) 



This choice gives the correct ratios of vertex degrees at all 
higher levels. However, at the first level, where no link is 
already present, it is not obvious that the choice correctly 
weights the vertices with no children at all. We return 
to this issue when we discuss the ensemble of branching 
processes. 
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The properties of the model are then a textbook ex- 
ercise JlH , but for completeness we summarize the main 
steps. The main concern is the with the cluster sizes, 
in particular the question of percolation. This approach 
based on branching processes is identical to the studies 
of percolation on trees, for example Bcthc lattices, which 
were popular in the 1980's JTo| and provided a mean field 
model for the percolation transition. 

Percolation occurs in this model when the extinction 
probability of the branching process is less than unity. 
This extinction probability may be calculated using the 
generating function for the probabilities (0): 



n1(x) - 
n2(x) 

n2y\yiicixi 
n3(x) 

n3static(x) - 
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1 + 5 - 8x 



(7) 



The extinction probability is given by the smallest root, 
Xq, of the equation: g(x) = x. This root is 1 for all 
values of 5 so percolation never takes place (though, in 
the same way as for one dimensional percolation, 5 = 1 
may be regarded as a critical point). 

The technique above can be extended to find the dis- 
tribution rii, of finite clusters in this model. For a sin- 
gle branching process, the generating function, p(x) = 
5~J^ nf x l , for the quantities nf , which are the probabil- 
ities that the process contains i nodes, is given by the 
solution to, p{x) = xg(p(x)), and is found to be: 



p(x) 



(1+*) 
25 



1 



-y^a+sr-^sx 



(8) 



The quantities nf may now be read off, however these 
are not the cluster numbers rij, as usually defined. The 
static model is an ensemble of branching processes, so 
nf corresponds to the number of clusters of size i per 
process, but rii is the number per node. To relate these 
quantities we compute the average number of nodes in 
a branching process as pi(l) = 1/(1 — S). In the limit 
of a large ensemble we then find rii = (1 — 5)nf . A 
proper discussion of the ensemble would allow a number 
of isolated nodes besides the clusters based on branching 
processes, in order to adjust the degree distribution. This 
more careful discussion leads to the same result. 



1 



Tlx 



n 2 



n 3 = 



1 + S 

5(1-5) 
{l + 5f 
26 2 {l-6) 
(1 + S) 5 



(9) 
(10) 

(11) 



A recursion relation may be obtained for higher order 
terms. These results are used for comparison with the 
tree growth model. 



FIG. 1. The fraction of vertices in clusters of size 1,2,3 
(ni,ri2,n3) accor ding to the formulae ( |l^Jl5| ) and also the 
static results Mndllll) . These predictions coincide for clusters 
of size 1. Simulation results lie on the exact curve but are not 
shown in this plot. 



C. Cluster Size Distribution 

The expected number of clusters of size i, Ni, in the 
tree growth model obey a set of evolution equations that 
can be obtained by noting that the probability of making 
a link to a cluster of i vertices is iNi/t. In contrast to the 
situation in more complicated models, these equations 
are linear, exact and hold for finite t. 



Ni(t + 1) = Nx(t) - S^P- + (1-5) 



(12) 



N i (t+l) = N i (t)-6^ + S^-^ Ni -^ (13) 



t 



t 



The expected total number of clusters Y^T Ni(t) grows 
linearly in time and is given by (1 — 5)t, since a new 
cluster is created whenever a link is not made in a time 
step. By summing the equations (weighted by i), we also 
find that the first moment is given by the total number of 
vertices, J^T iNi(t) = t. These relations also reflect the 
fact that each cluster is a tree graph, so the number of 
links is the number of vertices minus one. We search for 
the cluster size distribution, rii, of the form, iVj(t) = n,< 
and find the following recursion relations: 



1 



Ui = /-x , -n Ui - 1 
{to + 1) 



i > 2 



(14) 
(15) 



Although the first term, m, is (by design) the same as 
for the static model, later terms are different. Figure [j] 
shows the first few terms of the cluster size distribution 
for both the static model and the growth model. Notice 
that while the exact result is similar to the static one for 
small delta, it is smaller for larger delta. 
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Indeed, the large cluster behavior of the static and the 
growth model are completely different. The large cluster 
behavior of ( |l5| ) is power law: 



71^(2 + l/6)i 



-(1+1/6) 



(16) 



That of the branching model is dominated by exponential 
decay. 

The power law decay of the cluster distribution for 
the growing model has been noticed by Dorogovtsev et 
al. Jjj in the non-percolating phase of non-linear growth 
models. They have termed it a self organized critical 
state because the preferential attachment to larger clus- 
ters which causes the power law decay occurs automati- 
cally. For non-grown networks, exponential decay of the 
cluster numbers is a common feature (as in the static 
example). This difference has consequences for the way 
clusters grow. 



III. PERCOLATION 

Direct numerical simulations of the growing network 
indicate that for values of 8 larger than about 1/2, there 
is a cluster of size considerably larger than the others. 
This cluster often contains the original vertex and sug- 
gests that there may be a percolating cluster based on 
the "old core" of vertices that are created early in the 
growth. These numerical simulations are in fact mis- 
leading, but expose anomalous finite size effects that are 
studied below. 

An analytic approach to percolation does not take the 
usual route because the equations ([l2 13) are exact and 
hold for any incipient percolating cluster besides the fi- 
nite clusters. Ordinarily, the sum, on ly accounts 
for finite clusters and the infinite cluster must be added 
separately. However, according to the equations, this 
sum equals t and contains all the vertices, thereby leav- 
ing no room for an infinite cluster. The generating func- 
tion approach used in although pleasantly tractable, 
merely reproduces this information. 

Percolation does not occur in this model, except in the 
trivial limiting case 8 = 1 where the network just consists 
of a single tree graph. This phase diagram resembles that 
of ordinary one dimensional percolation. To understand 
the reasons why percolation does not take place, yet large 
clusters do appear at finite size, it is helpful to study the 
numerical data for the maximum sized cluster. This will 
then lead us to an investigation of the cluster containing 
the original site. 



Total number of vertices: t 



FIG. 2. Scaling of largest cluster size fraction against t for 
8 = 0.1,0.5,0.8 

suffers from an anomalously slow finite size effect, becom- 
ing smaller as the growth process is continued to larger 
times. For example, at (5 = 0.8, the fraction drops from 
about 0.29 at t = 10 3 , to 0.17 for a network 10 times 
larger. In figure ^ we show the largest cluster fraction 
against log(i) for various 8. The straight lines clearly 
indicate a power law dependence. The exponent can be 
determined by fitting, or by noticing that another plot of 
the same quantity (log) against 8 displays linear depen- 
dence. In any event, the lack of any transition is clear. 
The fit suggests the form: 



Fraction of vertices in largest cluster ~ t 



<5-l 



(17) 



This form of scaling behavior can be deduced from the 
original growth model. Consider a large isolated cluster, 
Nj = 1. By treating its size, i, as a continuous variable, 
we find that it grows according to the probability that a 
link will attach the new vertex to this cluster: 



i(t + 1) = i(t) + 8 



(18) 



There is no solution linear in t, but a form i ~ t s solves 
the equation in the large time limit. The fraction of sites 
in this largest cluster, i/t, therefore follows the scaling be- 
havior observed numerically in (|l7]) . As the system grows 
very large, the relative size of even the largest cluster de- 
creases and it is apparent that the tree growth model 
never experiences true percolation. 

In most static models with percolation, for example 
random graph models, the finite size scaling of the maxi- 
mum cluster size is given by log(t). This is related to the 
usual exponential decay of the cluster size distribution, 
and the power law behavior we see here follows from the 
distinctive decay (|l6|) in growth models. 



A. Numerical Study 



B. Cluster Containing the Initial Point 



On closer inspection of the numerical data it is found 
that the fraction of sites contained in the largest cluster 



The overall distribution of the sizes of randomly chosen 
clusters ( ^4] , ^5[ ) does not give any hint of the presence of 
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the large cluster seen in the numerical work above. It is 
hard to investigate the maximal cluster analytically, but 
if we rely on the observation that the maximal cluster is 
likely to be based on one of the oldest vertices, we may 
approach the problem from a different perspective. The 
distribution of the size of clusters that contain the origi- 
nal point is amenable to analytic methods and does shed 
some light on the presence of a large cluster. The possi- 
bility of studying this quantity is of course only available 
in grown networks that have distinguished vertices. 

As before, we start by writing evolution equations, this 
time for the probability Pi (t) that a distinguished cluster 
has size i at time t (1 < i < t). 



P 1 (t + l) = P 1 (t)-5 



Pi(t) 



(19) 



P i{ t + l)= m -8 i ^ + 6^¥tm (20 ) 



P t+ i(t + l) = 6P t (t) 



(21) 



These equations are very similar to the ones for the over- 
all cluster size distribution Ni(t) in ( |l2] , |l3| ), however, 
the difference in the first equation prevents any solution 
Pi(t) oc t. The equations actually hold for any distin- 
guished cluster, with the initial condition determining 
which cluster is selected. Simplest is to choose the cluster 
distinguished as containing the original point, in which 
case Pi(l) = 1. Other possibilities, for example the clus- 
ter containing the second point would be determined by 
the values at t = 2, P x (2) = 1 - 8 and P 2 (2) = 8. This in 
fact leads to the same distribution as for the first point, 
but a difference is obtained for the third point which 
is specified by: P x (3) =1-6, P 2 (3) = 5(1 - 8) and 
Ps(3) = 5 2 . In the following, we shall only consider the 
cluster containing the original point. 

The sum J2i Pfe(*) is preserved by these equations, and 
can be set to 1, as expected for a probability, by the initial 
condition. The average size of the distinguished cluster, 
k(t) = Yfi kP k (t), obeys k(t_+ 1) = (1 + S/t)k(t). So at 
large times we expect that k(t) ~ t 5 . This is essentially 
the same argument as in (|T^ ) of the last section and in- 
deed the evolution equation has the same intuitive origin. 
In this form the prefactor can be determined from the 
initial condition. Evolution equations for all the higher 
moments of the distribution will be considered below. 

For large t and k, the continuum version of the evolu- 
tion equation becomes: 



dP_ _ s d(kP) 
t ~dt^ ~dk~ 

which has a scaling solution, 

P(t,k)=t- 5 f(kr s ) 

where f(u) is any function. 



(22) 



(23) 




FIG. 3. Size distribution of the cluster containing the origin 
with a scaling plot of t s Pk(t) against t~ s k. For a variety of 
values of S. The lines are obtained by numerically solving the 
equations (^EoUill) up to t = 10 4 . 



This result is confirmed, and the form of the scaling 
function f(u) determined, by numerically solving the dif- 
ference equations (|l9| , po|,pl| ) and plotting them appropri- 
ately as shown in figure |3| No change in the form of the 
function is visible as t is increased beyond about 2000. 
A scaling relation of this form is interesting because it is 
found for all values of 8 not just those in the vicinity of 
the critical point at 8 = 1. 

The scaled cluster distribution shows a clear change in 
form around 5=1/2. Although the mean of the distribu- 
tion varies smoothly with <5, and is close to 1 on the scaled 
plot (corresponding to k(t) = t s before scaling), the mode 
moves away from zero (cluster size, k = 1, before rescal- 
ing) as 8 becomes greater than about 1/2. Eventually, as 
5 — ► 1 the scaling function becomes progressively more 
peaked around u = 1. This provides an argument for the 
likely presence of a large maximal cluster for 8 > 1/2. 

The form of the scaling function is not easy to deter- 
mine analytically. Only in the limit of large or small it, 
can f(u) be determined using the solutions for P\{t) and 
P t (t) obtained from @ and @. 

For comparison with simulations it is better to com- 
pare the moments of the distribution rather than the full 
form. The moments, defined as, 



S n (t) = Y J k n P k {t) 



(24) 



obey simple equations obtained from weighted sums of 

S (t+l)=S (t) (25) 
Si(t+l) = (l + |)Si(t) (26) 

Sa(t + l) = (l + y)5 a (t) + |si(t) (27) 
And similar equations for higher order moments. By 
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FIG. 4. First two moments of the size of cluster containing 
the origin. The upper curve is for the second moment, and the 
lower curve is for the mean. Each moment is scaled according 
to t~ nS S n (t) and based on simulations of 4000 samples of 
networks containing 10 4 vertices. The lines are the theoretical 
results including the sub-leading finite size terms. 



forming suitable linear combinations, these equations can 
be solved in terms of the following function: 



t-i 



R(z,t) = ]J(l + z/i) = 



i=l 



r(z + t) 
r(t)r(* + i) 



T(z + l) 



For example: 



So(t) =R(0,t) = 1 

S 1 (t)=R(6,t) 

S 2 (t) = 2R(2S,t) - R(6,t) 



(28) 
(29) 



(30) 

(31) 
(32) 



The general case is not difficult to work out, and it is 
also possible to treat clusters containing other than the 
original point. As t becomes large, R(nS,t) ~ t nS , so the 
leading term dominates and S n (t) — > n\R(nS,t). How- 
ever, for finite t, the sub-leading terms are large in the 
region 5 < l/log(t) and must be kept in numerical work. 

In figure we show comparisons of these formulae 
against simulation results for the mean and the second 
moment. Bearing in mind the scaling behavior, we plot 
each moment divided by a power of t s . These first mo- 
ments show excellent agreement. 



IV. VERTEX DEGREE CORRELATIONS 

To conclude our study of the tree growth model we 
follow the same argument used by Calloway et al. to 
determine the correlations between the vertex degree at 
each end of a randomly chosen link. The number of edges 
that join vertices of degrees j and k is denoted Ej k . This 
matrix is symmetric. For links that join vertices of the 



same degree, E k k is defined to be twice the number of 
such links. In this case exact evolution equations can be 
derived by treating the vertices with a single link spe- 
cially: 
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En(t) 
E lk (t) 



Eu(t 
E lk (t + 1) 

E jk (t + l)=E jk (t)+6[p k 
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(33) 
(34) 

(35) 



Where the dk(t) and pk are the vertex degree numbers 
and their probabilities as determined earlier in section 0. 

The total expected number of links is given by 
\ Sjfe Ejk(t) and the evolution equations show that it is 
given by St as anticipated. We therfore write the proba- 
bilities as Ejk(t) — 28tejk, and derive the following equa- 
tions. 



(l + 2<5)eii =p 
(1 + 26)e lk = + 6 ei k -i 
(1 + 2S)e jk = 5 (e.jk-1 + ej-ik) 



(36) 
(37) 
(38) 



By appropriately multiplying these equations and 
adding, we can find the following relations between the 
moments: 



Mr, = 



jk 



2_^3 e ik 

jk 



1 



= 1 



M 2 = ^2jke jk = l + 26 Mi + 



jk 



JPJ 



(39) 
(40) 
(41) 



Using the results of section || on the vertex degrees, we 
find the average degree k Pk — k — 26. The sum 
above includes vertices with no links, and the average 
degree on the end of a randomly chosen link is, fi = 

J2k 2 Pk /j: k Pk = l + 26- 

The covariance between vertex degrees at each end of 
a randomly chosen link is defined as: 



C 



3 k 



M 2 - 2/iM! + n 2 M 



(42) 
(43) 



Combining these results we find that C vanishes identi- 
cally and that there is no correlation between the degrees 
at the end of randomly chosen links in this model. This 
result is supported by simulations. 

In view of this result, it is slightly surprising that the 
analog static model which was specifically designed to 
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avoid these correlations, is not identical to the tree graph 
model. There is still a distinction as was apparent from 
the cluster numbers. 



V. TWO LINK GROWTH MODEL 

The preceding study of cluster growth in the tree 
growth network has been reasonably tractable, funda- 
mentally due to the tree property of the network. The 
question arises as to which features are preserved in more 
general models. 

The most obvious difference in more complicated mod- 
els is the presence of a percolating phase. The tree growth 
model has no percolating phase except the trivial one at 
8 = 1. The physical reason for this deficiency is not di- 
rectly the tree nature of the network. The cause should 
rather be sought in the growth itself. There is no mech- 
anism to attach existing clusters to each other. A mech- 
anism of this type was responsible for the percolating 
properties in the model of Calloway et ai, and clearly 
introduces non-linearities into the model, for example in 
the equation for the generating function of cluster sizes. 

A natural extension of our tree growth model is a 
model in which at each time step a new vertex is created 
and then, with probability S connected by two links to 
the existing vertices. Each new link is assigned a random 
terminating vertex amongst the existing vertices. The 
networks grown by this model are not necessarily tree- 
like and loops can form. The static analog is therefore a 
random graph rather than a branching process. Again, 
in numerical work, we use an initial condition consisting 
of a single vertex. A further generalization that we have 
considered is a model in which there is are fixed proba- 
bilities for single and double link connections. This leads 
to a more complicated phase diagram, but not to any 
significantly new observations. 

Below, we present the basic properties of the two link 
model closely following the methods of Calloway et ai. 
We then perform simulations to study the cluster prop- 
erties and compare them with what was found in the tree 
model. 



A. Two Link Growth Model - Degree Distribution 

The equations leading to the degree distribution that 
are obtained by the same means as for the tree growth 
model. 



do(t + l) = do(*)-25*^ + (l-J) 
d 1 (t + l) = d 1 (t)-28^ + 2S^- 
d 2 (t + l)=d 2 (t)- 28^ + 28^ + 8 (46) 



(44) 
(45) 



d k (t + 1) = d k (t) 28^ + 25^^L (47) 

Note that the total number of vertices, t, can be written 
as Y^q dk(t) and that the expected number of links, 2St, 
is given by \ kdk{t). Searching for solutions of the 
form, dk{t) = p k t, we find: 

1 -8 



Po 
Pi 



1 + 28 
28(1 - 8) 
(1 + 25) 2 



Pk = (1 + S8) 



2 k-2 8 k-l 

(1 + 28) k + 1 



(48) 
(49) 

(for k>2) (50) 



Again this distribution decays exponentially after the 
first couple of terms. 



B. Two Link Growth Model - Cluster Size 
Distribution 

The cluster sizes Ni in the two link growth model obey 
a set of evolution equations which are now approximate 
and only valid for finite clusters at large t since processes 
in which both links end in the same cluster are ignored. 
This is the same approximation that is made in the Cal- 
loway et ai analysis. 

JV 1 (t + l)=JVi(t)-2<5^^ + (l-5) (51) 



N 2 (t + 1) = N 2 (t) - 28 
N l {t + l) = N l {t)-28 



2N 2 {t) 
t 

iNi(t) 



i-2 . 



jN 3 (t) {i-j-l)N w {t) 



t 



t 



(52) 



(53) 



Solutions of the form Ni(t) = n^t are considered and a 
recursion relation obtained: 



1 + 2(5 

n 2 = 



i-2 



n t = 



I" ' l: "< i ' 



(54) 
(55) 

(56) 



Analysis of these cluster numbers is best carried out us- 
ing the generating function for the cluster sizes, g(x) = 
iriiX 1 , which obeys a non- linear equation: 



9 =Ts 



1 f 1 - 8 - g/x + Sg 1 



1 - xg 



(57) 
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delta = 0.2, maximal h 
delta = 0.2, original < 

delta = 0.1 , maximal > 
delta = 0.1, original : 



Total number of vertices: t 



FIG. 5. The fraction of the vertices contained within finite 
clusters. Obtained by numerical integration of the differential 
equation ^ using a step size of 10 -6 . 



C. Two Link Growth Model - Percolation 



FIG. 6. Maximal and original cluster fractions in the two 
link model for two values of 8, (0.1,0.2) within the perco- 
lating region. In each case the maximal curve is above the 
original one. Averages are taken over a large number of sam- 
ples ranging from 10 2 for the largest networks to 10 5 at the 
smallest. 



We compute g(l) by numerically integrating the equa- 
tion (57) starting from an initial condition (g(e) — nie). 
Figure |5j shows the results, and we recall that g(l) is 
the expected fraction of vertices contained in the finite 
clusters, so when it differs from 1, percolation occurs. 

The model percolates for most of the range of 5, but for 
a range of small <5 there is no percolation. It is possible 
to obtain the critical value <5 C by studying g'(l). In the 
percolating region g{\) < 1, so it is simple to take the 
x — ► 1 limit of the right hand side of equation (|5^) to 
obtain, 



</(!) 



6(1 + 9(1))) 



(58) 



In the case S < S c , g(l) — 1, and this limit must be 
taken more carefully with the help of L'Hopital's rule. 
The resulting quadratic equation can be solved to give: 



</(!) = 



1 - 45 ± Vl - 16<5+ 16<5 2 
45 



(59) 



We omit regions where the root is not real, and fur- 
ther require that it be positive. Finally, recognizing that 
g(l) — > 1 as <5 — > since in this limit all clusters have size 
one, we are able to pick the negative sign as being the 
only correct branch. 

In summary: d c = 1/2 - V3/4 w 0.06699. With g'(l) 
taking different values on each side: 



<?'(!) 



l-4r5-^l-16rS+l() < 'i T 

£ (1-1(1 +g(l))) 



for 5 < S c 
for S > S r 



(60) 



The critical behavior we have described is very simi- 
lar to that observed in the model studied by Calloway et 
al. By performing a similar investigation near the crit- 
ical point, we find the same signals of an infinite order 
transition with 1 - g'(l) ~ e a/v/7 ^. 



VI. CLUSTER GROWTH IN THE TWO LINK 
GROWTH MODEL 

In this section we describe the results of numerical sim- 
ulations to find how large clusters grow in this model. We 
track both the maximal cluster and the cluster containing 
the original vertex. 

In the region above the percolation threshold the max- 
imal cluster naturally grows with t. It is interesting to 
see how finite size affects influence this and how the clus- 
ter containing the original point grows. This is shown in 
figure ^ which indicates that there is a region where the 
original cluster is smaller than the maximal one, but as 
the size of the network increases, this cluster approaches 
the size of the maximal one. This result supports the 
intuition that the "old core" of vertices act as a seed for 
the percolating cluster. Indeed, the probability that the 
maximal cluster contains the original vertex appears to 
grow to 1 for any <5 in the percolating phase. Unfortu- 
nately the statistics for this analysis are not good for the 
sizes we have considered and this result should only be 
taken as suggestive. 

The finite size effects are most apparent for 5 = 0.1 
which is quite close to the critical point. In this case the 
fraction of sites in the either maximal or original cluster 
decrease with t in a way reminiscent of the behavior in 
the tree growth model. Estimates of a correlation size 
can be made on the basis of logarithmic plots which show 
a clear change in slope as the network size exceeds the 
correlation size at that value of 5. 

It is the situation below the percolation threshold that 
holds more interest for comparison with the tree growth 
model. In figure |7] we show evidence that the maximal 
cluster scales with a power law decay in this region. The 
original cluster behaves in the same way. This is exactly 
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delta = 0.05 r 
delta = 0.03 f 
delta = 0.01 i 



Total number of vertices: t 

FIG. 7. Scaling of maximal cluster in the two link model 
below the percolation threshold. The original cluster follows 
similar curves. For S = 0.01,0.03,0.05. Averages are taken 
over a large number of samples ranging from 50 for the largest 
networks to 10 4 for the smallest. 



as in the tree growth model, and as emphasized before, 
quite distinct from the log(i) behavior in random graph 
models. 

Having demonstrated that scaling occurs in the same 
way as in the tree growth model, we postpone any further 
study of the exponent of the growth. This is because of 
the difficulty of getting far from the critical point in this 
particular model. 
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VII. CONCLUSION 



The study of grown networks was originally motivated 
by real networks which are by nature finite. We have 
shown in a simple tree growth model, that although an 
infinite size system does not display percolation, finite 
systems of sizes that may have relevance to observations, 
often contain large clusters. These clusters grow with a 
power law dependence on the system size and provide 
another manifestation of the critical nature of the whole 
phase. The power law growth can be analysed carefully 
in this model, especially by studying clusters with dis- 
tinguished points, but the pattern of power law growth 
appears to be general as found in numerical simulations 
in a non-linear model. 
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